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Abstract 



Many studies in Economics and other disciplines have been re- 
porting distributions following power-law behavior (i.e distributions 
of incomes (Pareto's law), city sizes (Zipf's law), frequencies of words 
in long sequences of text etc.)[l, 6, 7]. This widespread observed reg- 
ularity has been explained in many ways: generalized Lotka-Volterra 
(GLV) equations, self-organized criticality and highly optimized tol- 
erance [2,3,4]. The evolution of the phenomena exhibiting power-law 
behavior is often considered to involve a varying, but size independent, 
proportional growth rate, which mathematically can be modeled by 
geometric Brownian motion (GBM) dXt = rtXfdt + aXfWt where Wt 
is white noise or the increment of a Wiener process. It is the pri- 
mary purpose of this article to study both the upper tail and lower 
tail of the distribution following the geometric Brownian motion and 
to correlate this study with recent results showing the emergence of 
power-law behavior from heterogeneous interacting agents [5]. The re- 
sult is the explanation for the appearance of similar properties across 
a wide range of applications. 



1 Introduction to power law distributions 

This paper stems from the curiosity of the author to peer into the ap- 
parent similarities in the distribution of quite diverse phenomena, occurring 
in Social and Natural Sciences and usually explained by the use of skewed 
distributions. Scholars and scientists are used to measure variables pertinent 
to their fields, associating to each measure a particular scale. Often, as in 
the case of individual wealth across a population, the measurement can span 
several order of magnitude depending on the chosen individual. Similarly, 
cities size or firms size can present values distributed across multiple scales. 
In all these cases, the distribution representing the sample under investiga- 
tion appears right skewed, i.e. while the majority of the measurements lie at 
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lower level, there are quite a few measurements that are plotted well to the 
right that the typical value, creating a tail in the distribution. As a curiosity, 
we notice that in Economics this tail goes by the name of fat tails, in Physics 
it is referred to as critical fluctuations, in Computer Science and Biology it 
is the edge of chaos, and in Demographics and Linguistics it is called Zipf's 
law. 

It has been reported in Economics since 1897 by Pareto that the distri- 
bution of personal income possesses, after a certain threshold, a heavy-tail 
following a power law (i.e. ~ x~ a ) [14]. In his work, Pareto indicated a 
value for the exponent a of about 1.5. The same behavior has been reported 
more recently in other disciplines. For example, the Harvard linguist Zipf 
discovered, in 1949, analyzing the frequencies of words in long sentences of 
text that they were distributed according to a power law with a value of 
the exponent a equal to one [7]. In this respect, Zipf's distribution (also 
known as zeta distribution) can be considered the discrete counterpart of the 
continuous Pareto's distribution. 

Today, we are aware that many other phenomena have been reported 
following power laws with various values of the exponent a like clusters of 
Bose-Einstein condensate near absolute zero (following Pareto's law) [16], city 
sizes (following Zipf's law), the number of hits on a website, just to mention 
few cases [1,3, 6, 23]. 

Similar analysis for the distribution of income have showed in recent years 
a value of the exponent close to 1.4 (Pareto' law), for the case of Great Britain 
distribution of wealth [15], and to 1.0 (Zipf's law), for the case of American 
firm size as measured by receipts and employees [1]. A different result was 
obtained by Stanley et al., who reported that American firm size followed a 
lognormal distribution instead of a power law distribution [17]. 

Lognormal distributions were advanced, probably in the first real attempt 
to tackle this problem from a theoretical standpoint, by Gibrat in 1931. 
Gibrat first observed that assuming that the growth rate of each firm was 
independent of its size (Gibrat's law in weak form), the distribution of firms 
size should have been right skewed and, if the rates of growth were only 
moderately correlated, such distribution should have been a member of the 
lognormal family (Gibrat's law in strong form) [18]. 

From an empirical point of view, it is known that growth rates of firms' 
output and countries GDP follow exponential densities or, as they are also 
known, Laplace distributions (i.e. ~ aexp[— ax], x > 0) [19,20]. This fact 
would confirm inherently Gibrat's law in weak form. The strong form of 
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Gibrat's law is confirmed only by the study of Stanley et al. but is not 
confirmed by several other analysis starting from Pareto and ending, more 
recently, with Axtell. This paper aims at fixing these apparent inconsisten- 
cies of the theory with the associated empirical findings and at putting in 
perspective some triumphalism about natural laws. 

2 Explanations of power law distributions 

An interesting quest for explaining the reasons of this apparent similarities 
is currently on-going and involves several scholars and researchers, all united 
in a multidisciplinary effort. This widespread observed regularity has been 
explained in many ways: Generalized Lotka-Volterra (GLV) equations, self- 
organized criticality (SOC) and highly optimized tolerance. 

GLV equations involve three scalar parameters and a probability distribu- 
tion, each having definite roles in the model's interpretation. These equations 
are suitable for multi-agents simulations. The first and second term of the 
equations are typically the auto-catalytic terms. They are, respectively, the 
growth rate of the system that is proportional to the size of the variable un- 
der investigation (i.e. individual wealth, city size etc.) and to the arithmetic 
mean of the same variable across the system, while the last term represents 
the mean-field interaction and is to be considered a competing term. The 
balance of the auto-catalytic terms with the competing term determines dy- 
namical equilibrium in the system and power law distributions [2, 3]. 

If the system is far from equilibrium, self-organizing phenomena and a 
state of self-organized criticality (SOC) may occur. According to the notion 
of SOC [21], scaling emerges because the sub-units of a system are heteroge- 
neous and interact, and this leads to a critical state without any attractive 
points or states. The occurrence of a power law may be read as a symptom 
of self-organizing processes at work. A notable example of this approach 
applied to macroeconomics is the inventory and production model developed 
by Bak et al. [22]. 

The classic example of highly optimized tolerance can be explained through 
a model of forest fires. Suppose that fires start at random in a grid-like forest 
and that, instead of appearing at random, trees are deliberately planted by a 
knowledgeable farmer. One can ask what the best distribution of trees is to 
optimize the amount of lumber the forest produces, subject to random fires 
that could start at any place. The answer turns out to be that one should 
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plant trees in blocks, with narrow firebreaks between them to prevent fires 
from spreading. Moreover, one should make the blocks smaller in regions 
where fires start more often and larger where fires are rare. The reason for 
this is that we waste some valuable space by making firebreaks, space in 
which we could have planted more trees. If fires are rare, then on average 
it pays to put the breaks further apart so that more trees will burn if there 
is a fire, but we also get more lumber if the event do not occur. Carlson 
and Doyle show both by analytic arguments and by numerical simulations 
that, for quite general distributions of starting points for fires, this process 
leads to a distribution of fire sizes that approximately follows a power law. 
The distribution is not a perfect power law, in this case. Carlson and Doyle 
have proposed that highly optimized tolerance could be a model not only for 
forest fires but also for the sizes of files on the world wide web, which appear 
to follow a power law [4, 24, 25] 

Before starting our journey, in order to temper the enthusiasm behind 
power-law distributions, which are described too often as able to represent 
almost every complex phenomena, we would like to remind us the caveat 
clearly expressed by Prof. Feller commenting the good fit of the logistic dis- 
tribution function with real life phenomena "Lengthy tables, complete with 
chi-square tests, supported [the] thesis [that a logistic distribution function 
could be representative] for human populations, for bacterial colonies, de- 
velopment of railroads, etc. Both height and weight of plants and animals 
were found to follow the logistic law even though it is theoretically clear that 
these two variables cannot be subject to the same distribution. [...] The only 
trouble with the theory is that not only the logistic distribution but also 
the normal, the Cauchy and other distributions can be fitted to the same 
material with the same or better goodness of fit." [8,9] 



3 The stochastic differential equation GBM 

Since the phenomena we are going to investigate have a varying, but size 
independent, proportional growth rate, they are described by the following 
stochastic differential equation (GBM) 

f - <** w 

where at = r t + aWt so that r t = r represents the constant growth rate 
to which we have added white noise disturbance W t or the increment of a 
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Wiener process. As we will see, this stochastic equation does not allow for 
power law distributions but only for lognormal distribution unless we revisit 
our understanding of the concept of time. 

The way W t is typically constructed is as a probability measure on the 
space of tempered distributions on [0, oo) and not as a probability measure 
on the much smaller space i?[° ,00 \ like an ordinary process can. W t has 
stationary independent increments and is equivalent to a Brownian motion 
B t following a normal distribution with mean and variance t [10,11,12,13]. 

Let us make use of the Ito calculus and formally replace W t by dB t dt in 
equation [1]. We then obtain 

dX t = r t X t dt + aX t dB t . (2) 
The solution to this equation is 



L 



*dX t 



r t t + adB t , (3) 



where we have assumed B = 0. 

The integral on the left side must be evaluated using the Ito's formula for 
stochastic integrals so that 



d ^X t = ^dX t + \(-^)^-\ a H t (4) 
and, after a final integration, 



lnX t = lnX + (r- ^a 2 )t + aB t . (5) 

Equation (5) is the functional form that links the two variables X t and 
B t . We can easily proceed on deriving the distribution of lnpT t ) from that 
one of B t) after a certain threshold value T. 

It is a trite calculation to obtain 



lnXr -> N 



lnX + (r- \a 2 )T, a 2 T 



(6) 



where represents the normal distribution. We have then proved that GBM 
can provide only lognormal distributions. 

Let us assume in a way similar to what has been presented by Reed [26] 
that the threshold value T is a random variable itself, distributed according 
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to an exponential law or Laplace law, fr(t) = v exp[—ut],t > 0. In a certain 
sense, it is like saying that the evolution of the system (to fix the ideas let 
us take firms size) follows the GBM for an interval of time, which is itself a 
random variable and can be related to the time of observation. 

With this assumption, we have that the moments generating function of 
the variable \nX T and fa are respectively 



E[e hlXT{t)s ] = exp j(lnXos) 



/ 1 2\ 1 2 2 

r a )s H — a s 

2 ; 2 



(7) 



and 

E[e^ s ] = (8) 

where E represents the functional of the mean value operator. 
It is an easy derivation, following Reed [26], to find that the state after 
time T, indicated by a tilde, lnX r is no longer lognormally distributed. 
In fact, we have 

up ^X s 

E T [E(e hlXTS \T)] = = } ^ (9) 

{v+ [{r-\a 2 )s + \a 2 s*\} 

Let us denote the two positive solutions mi and -m2 of the quadratic 
equation ^a 2 s 2 — (r — \a 2 )s — v = so that we can write 

E T [E(e l ^ s \T)] = e lnXos - ^ (10) 

(mi-s)(m 2 + s) 

It can be verified that the former equation is the moments generating 
function of the asymmetric Laplace distribution centered on In X with prob- 
ability density function 

W^^^- 1 "^^ (id 
/ ^» K) = ^T^ e ""' K "°*''^ 1,lA '» (12) 

From the previous distributions, we can deduce that X has the following 
distribution functions 
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showing power law behavior in both tails (Reed refers to this distribution by 
the expression double- Pareto distribution [26]). 



4 Analysis of the double-Pareto distribution 

An original part of our paper is represented by the exploration of the 
possible values of mi and m 2 and of their determinants. 

Solving the quadratic equation \a 2 s 2 — (r — \a 2 )s — v = 0, we obtain 



1 + 



(2r - a 2 f 



(15) 



m 2 = 



or 



1 + 



(16) 



Let us start considering the limits 

(2r - a 2 ) 



lim mi 



cr 

lim m 2 = 



(17) 
(18) 



and 



lim m x = oo 

v — »oo 

lim m 2 = — oo 



(19) 
(20) 



Few considerations are in order. First of all, the case in which v gets 
smaller means that the time between two consecutive evolutions according 
to the GBM model gets longer and, therefore, the phenomenon turns to be 
exclusively described by a lognormal distribution. In fact, it is more correct 
to affirm that the distribution is no more a power law but rather a lognormal 
distribution, even if we save only one of the two solutions that can make 
sense for certain r and a. When v gets larger, the model explodes, providing 
no further predicting power. 
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Let us continue estimating the following limits that will provide enough 
information to plot these functions in terms of a (see figure 1) 



and 



lim mi = oo, provided r > (21) 
limm 2 = -- (22) 



lim mi = +\ — (23) 
V r 



lim m 1 = - J- (24) 

lim m 2 = — \ — (25) 
«^^+ V r 

/U 

lim m 2 — +\ — (26) 



and, finally, 



lim mi = —1 (27) 

a— >oo 

lim m 2 = (28) 



It is quite interesting to see that a* = y2r separates two kinds of solu- 
tions: to the left of this point mi is positive and m 2 is negative, while to 
the right of this point the opposite occurs. Considering the particular sym- 
metry of the solutions, we can interchange mi with m 2 . We will say, then, 
that a+ is defined as the natural degree of stochasticity of the system and is 
proportional to the square root of the growth rate of the system. 

Another point that deserves our attention is the role played by the ratio 
between v and r, which are both the inverse of time. Essentially, these 
parameters measure respectively the time scale of the sampling of our system 
and of its growth rate. Based upon real life considerations, we can assume 
that the latter one is much greater than the former one and, therefore, put 
u/r « 1. 

This results can be translated into two separate regimes: 
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Figure 1: Plot of mi and m 2 as a function of a. 

1. for a < a* 

?7ii G (^Jujr ~ 0, oo), m2 G { — \j~u~fr ~ 0, —vjr = 0) (29) 

2. for a > a* 

mi G {—\fvjr w 0,-1), m2 G (ijvjr 0, 0) (30) 

Considering that a* can be associated to the natural degree of stochas- 
ticity of the system that is proportional to the square root of the growth rate 
of the system, while a can be referred to the internal degree of stochasticity 
of the system, we will then separate between two kinds of system: quasi- 
stochastic systems, defined by equation (29), and stochastic systems, defined 
by equation (30). 

It is definitely clear that quasi-stochastic systems have mi that can as- 
sume almost any positive value, while m 2 ps 0, that is these systems might 
have one of the two tails following a Zipf law. 

On the other side, stochastic systems are characterized by mi that results 
confined between and —1 and by m 2 ~ 0. This is the realm where most of 
the previous reported examples live. 

At this point, there should be no surprise in the fact that empirical data 
show both Pareto's and Zipf 's laws because the exact exponent depends on 
the ratio between the frequency of the sampling and the growth rate of the 
system, which depends ultimately on the degree of inherent stochasticity of 
the particular system considered, which is proportional to the growth rate of 
the system. 

5 Degree of Stochasticity and Heterogeneous 
Interacting Agents (HIA) 

We have seen in the previous section the existence of an internal degree 
of stochasticity associated with the parameter a. We can talk of a stochastic 
system whenever a results greater than the natural degree of stochasticity of 
the system defined by a* = y/2r. 
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Most of the systems considered in the examples of Section 1 are made up 
of agents. Agents are typically considered alike and non-interacting but in 
doing so there is a considerable lack of realism. Thanks to the new venue 
of agent-based simulation model, we can develop agents that are different 
and interacting. Agents are processes implemented on a computer that have 
autonomy (they control their own actions); social ability (they interact with 
other agents through some kind of 'language'); reactivity (they can perceive 
their environment and respond to it); and pro-activity (they are able to 
undertake goal-directed actions) [27]. 

Starting from an agent-based model, it is possible to achieve a micro- 
foundation of the system and, as a matter of facts, of its degree of stochas- 
ticity that will depend among other things by the number of agents and by 
the quality and features of the interactions among the agents. In all these 
systems, we would expect, having from one side an operational GBM system 
and from the other an observer, sampling key parameters with the consequent 
appearance of power laws. 

For this reason, it was no surprise for us to read that Delli Gatti et al. 
reported that using agents-based simulation model in which agents were af- 
fected by financial fragility observed as a consequence of business fluctuation 
the appearance of power law [5]. Similar universalities have been found by 
the author in an agent-based simulation model in which firms interact with 
other firms adjusting the prices and quantities of their products in order 
to survive (i.e. achieve sales in excess of total costs) [28]. In it, the firms 
sizes are distributed according a power law. Changing the degree of internal 
stochasticity of the simulation, by changing the level of interaction of the 
agents, the number of the agents and/or the intensity of the interaction, al- 
ters the exponent of the distribution as predicted by the model presented in 
the previous section. 

6 Conclusions 

Complex systems evolve according to stochastic models that can simply 
be modeled by a GBM equation. Together with the evolution of the system 
there is always a definite time of observation and evolution itself that must be 
considered a random variable as well. This fact changes drastically the nat- 
ural distribution function of the GBM model (i.e. a lognormal distribution) 
providing a power law at its place. 
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The exponent of the derived power distributions has several features de- 
pending both on the comparative level of internal degree of stochasticity of 
the system versus its natural degree that is proportional to the square of 
the growth rate and on the ratio between the frequency of the evolution of 
the system according to GBM and its growth rate. Thanks to these key 
parameters, we constructed a considerable versatile model, capable of great 
consistency between theory and empirical data and with strong predicting 
power. 

The final link between theory and empirical data is stressed by recent 
contributions provided by agents-based simulation models because they can 
generate in a "lab-like environment" stochastic evolutionary systems and 
micro-foundation. 

Our journey has not terminated with this paper, although we have pro- 
vided a small contribution to a deeper understanding of systems exhibiting 
stochastic evolution. These systems are all around us and represent the 
complexity of life. Economic systems, which are the object of our present 
research, have all the characteristics highlighted in the article and show some 
of the evidence found in previous equations. Even if we are yet totally uncon- 
vinced that scholars should refer to these regularities among diverse phenom- 
ena as universal laws, we have showed that it might exist a common pattern 
around which several and diverse phenomena can find an easy explanation. 

In order to continue the present effort, we will need to develop a bet- 
ter understanding of the correlation between natural and internal degrees 
of stochasticity of the system and its fundamental parts. The presence of 
heterogeneous interacting agents seem to be a powerful model to capture the 
complexity of stochastic systems. 
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